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Description 

The present invention relates to a colony stimulating factor (hereinafter "CSF") and. more particularly, 
to the cloning of the gene for the human granulocyte-macrophage colony stimulating factor (hereinafter 
5 "GM-CSF") by use of a nucleotide probe derived from a murine GM-CSF complementary deoxyribonucleic 
acid ("cDNA") clone to screen a cDNA library synthesized from messenger ribonucleic acid ("mRNA") 
containing human GM-CSF mRNA, and to the characterization of the GM-CSF gene. 

CSF refers to a family of lymphokines which induce progenitor cells found in the bone marrow to 
differentiate into specific types of mature blood cells. The particular type of mature blood cell that results 
10 from a progenitor cell depends upon the type of CSF present. For instance, erythropoietin is believed to 
cause progenitor cells to mature into erythrocytes while thrombopoietin is thought to drive progenitor cells 
along the thrombocytic pathway. Similarly, granulocyte-macrophage colony formation is dependent on the 
presence of GM-CSF. The present invention concerns the cloning of human GM-CSF. 

CSF. including human GM-CSF, is produced only in minute quantities in vivo. CSF-like factors have 
?5 been extracted from body organs, Sheridan and Stanley, 78 J. Cell Physiol. 451-459 (1971), and have been 
detected in serum and urine, Robinson et al.. 69 J. Cell. PhysioL 83-92 (1967); Stanley et at., 79 J. Lab. 
Clin. Med. 657-668 (1972). Researchers have reported isolating low titer CSF-like factor from Fuman 
peripheral blood cells which appear to be macrophages or monocytes, Moore and Williams, 80 J. Cell. 
Physiol. 195-206 (1972); Golde and Kline. 51 J. Clin. Invest. 2981-2983 (1972); Moore et al., 50 7, NitT 
20 Cancer Inst. 591-601 (1973). 

Although the factors identified by the above researchers have been reported to be CSF, heretofore 
sufficient quantities of homogeneous human CSF, including GM-CSF, have not been available to thoroughly 
investigate its biochemistry and biology. The availability of adequate quantities of homogeneous human 
GM-CSF would be valuable in investigations and possible treatment of proliferative blood disorders, such as 
25 certain leukemias and anemias. Also, human GM-CSF in greater purity and larger quantities than heretofore 
available, could prove useful in achieving successful bone marrow transplantation following cancer che- 
motherapy. 

One potential method of providing relatively large quantities of homogeneous human GM-CSF is 
through recombinant DNA techniques. Recombinant DNA techniques have been developed for economically 
30 producing a desired protein once the gene coding for the protein has been isolated and identified. A 
discussion of such recombinant DNA techniques for protein production is set forth in the editorial and 
supporting papers in Vol. 196 of Science (April 1977). However, to take advantage of the recombinant DNA 
techniques discussed in this reference, the gene coding for human GM-CSF must first be isolated. 

In accordance with the present invention, the gene coding for human GM-CSF is isolated from a cDNA 
35 library with a nick-translated cDNA probe. The probe is isolated from a murine GM-CSF cDNA library by 
use of a synthetic oligonucleotide probe corresponding to a portion of the nucleotide sequence of murine 
GM-CSF. Total human RNA is extracted from cell lines or other sources thought to produce relatively high 
levels of GM-CSF. Polyadenylated mRNA is isolated from the total RNA extract. A cDNA library is 
constructed by reversed transcription of the polyadenylated mRNA with reverse transcriptase. The DNA is 
40 rendered double-stranded with DNA polymerase I and inserted into an appropriate cloning vector. Resultant 
recombinant cloning vectors are used to transform an appropriate host. 

Transformed hosts are identified and grouped into pools. Plasmid DNA prepared from these pools is 
hybridized with the murine cDNA probe that has been radiolabeled. The pool(s) of clones that give a 
positive signal to the probe is identified and then the putative pool subdivided and the hybridization screen 
45 repeated. A single transformant corresponding to the human GM-CSF gene is eventually identified. Plasmid 
DNA is prepared from this transformant and characterized by DNA sequencing. In addition, the correspond- 
ing amino acid sequence is determined from the nucleotide sequence. The coding region of the human 
GM-CSF gene is cloned in a yeast host system to express mature GM-CSF. Thereafter biological assays 
are conducted to confirm that the expressed protein product is GM-CSF. 
50 The details of typical embodiments of the present invention will be described in connection with the 
accompanying drawings, In which: 

FIGURE 1 illustrates the amino acid and nucleotide sequence of the gene coding for murine GM-CSF 
with the portion of such gene employed as a probe for screening a human cDNA library indicated in 
solid underline; 

55 FIGURE 2 illustrates the amino acid and nucleotide sequence of the human GM-CSF gene, including the 
3* noncoding region; 

FIGURE 3 illustrates the pYafGM-2 expression plasmid with the coding region of the GM-CSF gene 
inserted therein for use in transforming host cells to express functional human GM-CSF; 
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FIGURE 4 illustrates Northern blot analysis of GM-CSF mRNA from various cell sources; and. 
FIGURE 5 illustrates Southern blot analysis of human genomic DNA. 

Sources of Human CSF Producing Cells 

5 

Preferably, a cDNA library, from which the gene coding for human Gf^-CSF will be sought, is 
constructed from cells previously found to produce relatively high levels of other lymphokines, under the 
assumption that they might also produce human GM-CSF. These sources may include malignant cell lines, 
such as a human lymphoma T-cetl line. Applicants have prepared cDNA libraries from several human 

10 lymphoma T-cell lines, such as HUT-102 and Jurkat. These particular cell lines are available from a wide 
variety of sources and have been used extensively by researchers. See. for instance, Leonard et aL, 80 
Proc. Natl. Acad. Sci. U.S.A. 6957 (1983). and Leonard et al,. 300 Nature (London) 267 (November 1982). 

Activated human peripheral blood mononuclear cells also potentially may be a source of GM-CSF 
molecules. For use in the present invention, the peripheral blood mononuclear cells can be separated from 

75 whole blood by standard techniques, such as by Ficoll-Hypaque centrifugation. Adherent cells are removed 
by plastic adherence and the remaining leukocytes multiplied by culturing in vitro in a serum containing 
medium together with a T-cell mitogen. 

Applicants have investigated a number of human cell lines, including those noted above, and peripheral 
blood T-cells as sources for the gene coding for human GM-CSF. As set forth infra, applicants have 

20 successfully isolated this gene from cDNA libraries prepared from the HUT-102 cell line and from mitogen 
activated, nonadherent leukocytes. 

Preparation of RNA from Human CSF Producing Cells 

25 Total RNA from human potentially GM-CSF-producing cells is extracted by standard methods, such as 
disclosed by Chirgwin et al., 18 Biochemistry 5294 (1979). and Maniatis et al., Molecular Cloning, a 
Laboratory Manual . Cold Spring Harbor Laboratory. Cold Spring Harbor, New York {^QB2). 

As is well known, when extracting RNA from cells, it is important to minimize ribonuclease ("RNase") 
activity during the initial stages of extraction. One manner in which this is accomplished is to denature the 

30 cellular protein, including the RNase. at a rate that exceeds the rate of RNA hydrolysis by RNase. In the 
procedures of Chirgwin et al., supra, and Maniatis et a!., supra at 196. this is carried out by use of 
guanidinium thiocyanate, together with a reducing agent, such as 2-mercaptoethanol (to break up the 
protein disulfide bonds). The RNA is isolated from the protein by standard techniques, such as 
phenol/chloroform extraction, ethanol precipitation or sedimentation through cesium chloride. 

35 Next, polyadenylated mRNA is separated from the extracted protein. Although several techniques have 
been developed to carry out this separation process, one preferred method is to chromatograph the 
polyadenylated mRNA on oligo (dT)-ce!lulose as described by Edmonds et al., 68 Proc. Natl. Acad. Sci. 
1336 (1971); Aviv and Leder. 69 Proc. Natl. Acad. Sci. 1408 (1972); and Maniatis et al.. supra at 197. The 
oligo (dT)-cellulose column is prepared with a loading buffer and then the mRNA applied to the column. 

40 Thereafter, the column is initially washed with a buffer solution to remove the unpolyadenylated mRNA and 
then the polyadenylated mRNA is eluted from the column with a buffered, low ionic strength eluent. The 
integrity of the polyadenylated mRNA is verified by gel electrophoresis. 

Preparation of cDNA from mRNA 

45 

A library of double-stranded cDNA corresponding to the total mRNA. as prepared above, is constructed 
by known techniques employing the enzyme reverse transcriptase. One such procedure which may be 
employed in conjunction with the present invention is detailed by Maniatis et al.. supra at 230. Briefly, the 
polyadenylated mRNA is reverse transcribed by using oligo-dT, that has been hybridized to the 

50 polyadenylated tail of the mRNA, as a primer for a first cDNA strand. This results in a "hairpin" loop at the 
3* end of the initial cDNA strand that serves as an integral primer for the second DNA strand. Next, the 
second cDNA strand is synthesized using the enzyme DNA polymerase I and the hairpin loop is cleaved by 
SI nuclease to produce double-stranded cDNA molecules. The double-stranded cDNA is fractionated by 
any convenient means to remove the shorter strands, thereby avoiding the needless cloning of small cDNA 

55 fractions. 

It is to be understood that in accordance with the present invention, alternative standard procedures 
may be employed to prepare double-stranded cDNA from mRNA. One such alternative technique is 
disclosed by Land et al.. 9 Nucl. Acids Res. 2251 (1981). In the Land et al. protocol, the hairpin loop is not 
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used as a primer for the second cDNA strand. Rather, the 3' end of the first cDNA strand is tailed with 
dCMP residues using ternninal deoxynucleotidyl transferase ("TdT"). This produces a 3' tail of poly-C 
residues. Then the synthesis of the second strand is primed by oligo-dG hybridized to the 3' tail. This 
technique is said to help avoid losing portions of the 5' tail of the second cDNA strand which might occur if 
6 the hairpin is cleaved with SI nuclease, as in the Maniatis et al. protocol. 

Cloning of cDNA 

Next, the double-stranded cDNA is inserted within a cloning vector which is used to transform 
JO compatible prokaryotic or eukaryotic host cells for replication of the vector. Thereafter, the transformants are 
identified and plasmid DNA prepared therefrom. 

To carry out the present invention, various cloning vectors may be utilized. Although the preference is 
for a plasmid, the vector may be a bacteriophage or a cosmid. If cloning occurs in mammalian cells, viruses 
also can be used as vectors. 
75 If a plasmid is employed, it may be obtained from a natural source or artificially synthesized. The 
particular plasmid chosen should be compatible with the contemplated transformation host, whether a 
bacteria such as Escherichia coll ("E. coll"), yeast, or other unicellular microorganism. The plasmid should 
have the proper origin of replicatiorTfoFThe particular host cell to be employed. Also, the plasmid should 
have a phenotypic property that will enable the transformed host cells to be readily identified and separated 
20 from cells that do not undergo transformation. Such phenotypic characteristics can include genes providing 
resistance to growth inhibiting substances, such as an antibiotic. Plasmids are commercially available that 
encode genes resistant to various antibiotics, including tetracycline, streptomycin, sulfa drugs, penicillin, 
and ampiciilin. 

If E. coli is employed as the host cell, many possible cloning plasmids are commercially available which 

25 may be used in conjunction with the present invention. A preferred plasmid for performing the present 
invention is pBR322. This plasmid has been fully sequenced, as set forth in Sutcliffe, 43 Cold Spring Harbor 
Symp. Quant. Biol. 77 (1979). A significant advantage of this plasmid is that it has 11 known unique 
restriction sites, including the Pst I site in the ampiciilin resistant gene. This feature is particularly useful for 
cloning by the homopolymer tailing method. 

30 If a bacteriophage is used instead of a plasmid. such phages should have substantially the same 
characteristics noted above for selection of plasmids. This includes the existence of a phenotypic marker 
and ligatable termini for attachment of foreign genes. 

Preferably, in the present invention, the double-stranded cDNA, having blunt ends, is inserted into a 
plasmid vector by homopolymeric tailing. As is well known in the art. in this technique, complementary 

35 homopolymer tracks are added to the strands of the cDNA and to the plasmid DNA. The vector and double- 
stranded cDNA are then joined together by hydrogen bonding between complementary homopolymeric tails 
to form open, circular hybrid molecules capable of transforming host cells, such as E. coli. 

In one procedure for homopolymeric tailing, approximately 50 to 150 dA nucleotid'eTesidues are added 
to the 3' ends of linearized plasmid DNA. A similar number of dT nucleotide residues are added to the 3' 

40 ends of the double-stranded cDNA and then the cDNA and plasmid joined together. 

In an alternative and preferred method, dG tails are added to the 3' ends of the cloning vector that has 
been cleaved with an appropriate restriction enzyme. For instance, if the pBR322 plasmid is employed, the 
restriction enzyme Pst I may be used to digest the plasmid at the ampiciilin resistant gene. Complementary 
dC tails are added to the 3' ends of the double-stranded cDNA prior to insertion of the cDNA segment in 

45 the plasmid with on appropriate annealing buffer. 

It is to be understood that the double-stranded cDNA may be inserted within plasmid cloning vectors by 
other various standard methods. One such alternative technique involves attaching synthesized nucleotide 
linkers to the ends of the cDNA strands by using DNA ligase. The linkers are cleaved with a restriction 
enzyme to generate cohesive termini for insertion within a plasmid cleaved with the same restriction 

50 enzyme. Scheller et al., 196 Science 177-180 (1977); Maniatus et a!., supra at 219. 

The recombinant DNA plasmids, as prepared above, are used to transform host cells. Although the host 
may be any appropriate prokaryotic or eukaryotic cell, it is preferably a well-defined bacteria, such as E. 
coli or a yeast strain. Such hosts are readily transformed and capable of rapid growth in culture. Other 
forms of bacteria, such as salmonella or pneumococcus, may be substituted for E. coli. In place of bacteria, 

55 other unicellular microorganisms may be employed, for instance, fungi and algaeT~Whatever host is chosen, 
it should not contain a restriction enzyme that would cleave the recombinant plasmid. 

If E. coli is employed as a host, preferable strains are MM294 and RR1 . Protocols for transformation of 
the MM294~host by a plasmid vector are well known, as set forth in Maniatis et aL. supra at 255; and, 
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Hanahan, 166 J. Mol. Biol. 557 (1983). Protocols for transformation of the RR1 host by a plasnnid vector are 
also well known" as set forth in Bolivar et al., 2 Gene 95 (1977) and Peacock et al.. 655 Biochem, Biophys. 
Acta. 243(1981). Other strains of E. coli which also could serve as suitable hosts include DH1 (ATCC No. 
33849) and C600. These strains and t'hi'MM294 and RR1 strains are widely connmercially available. 

5 In transfornnation protocols, including those disclosed by Maniatis et al., supra, and Hanahan, supra , 
only a snnall portion of the host cells are actually transformed, due to limited plasmid uptake by the cells. 
The cells that have been transformed can be identified by placing the cell culture on agar plates containing 
suitable growth medium and a phenotypic identifier, such as an antibiotic. Only those cells that have the 
proper resistance gene (e.g., to the antibiotic) will survive. If the recombinant pBR322 plasmid is used to 

10 transform E. coli strain MM294. transformed cells can be identified by using tetracycline as the phenotypic 
identifier. 

Preparation of Radiolabeled cDNA Screening Probe 



75 A radiolabeled DNA fragment composed of several hundred base-pairs ("bp") corresponding to a 
majority of the nucleotide sequence of the gene coding for the murine GM-CSF species is used as a probe 
to screen the above-prepared human cDNA library. The probe is isolated from a murine cDNA library with a 
radiolabeled, synthetic oligonucleotide probe corresponding to a portion of the nucleotide sequence of 
murine GM-CSF. 

20 To isolate the cDNA probe for use in the screening procedure of the present invention, a murine cDNA 
library is initially prepared from murine mRNA using the procedures set forth above. The mRNA is extracted 
from a murine cell line known to produce GM-CSF species. Such cell lines may include various T and 
macrophage cell lines, such as the T-tymphoma cell line LBRM-33 or clones thereof which are radiation- 
induced splenic lymphoma cell lines from the B-IO.BR mouse. This cell line and clones thereof are 

25 available from a wide variety of commercial and private sources and has been extensively used by U.S. and 
foreign researchers. Total RNA from the murine cells is extracted by standard methods as discussed above, 
for instance, by the use of guanidinium thiocyanate together with 2-mercaptoethanol. Thereafter, 
polyadenylated mRNA is separated from the extracted protein by chromatography on oligo (dT)-celluIose. 
A library of double-stranded cDNA corresponding to the murine mRNA is constructed, as discussed 

30 above, by employing reverse transcriptase to form an initial cDNA strand by using the mRNA as a template. 
Next, the enzyme DNA polymerase I is used to synthesize the second cDNA strand, employing the first 
strand as a template. The double-stranded cDNA is inserted within a cloning vector which is used to 
transform compatible host cells for replication of the vector. Preferably, the vector is composed of a plasmid 
having a number of unique restriction sites, such as the plasmid pBR322. The cDNA prepared from the 

35 mRNA may be inserted within this plasmid by homopolymeric tailing, as described above. The recombinant 
plasmids are used to transform a compatible host, such as a strain of E. coli. Of course, other appropriate 
hosts may be employed. The host cells that are transformed by the recombinant plasmid are identified with 
an appropriate standard phenotypic identifier, such as an antibiotic. 

A radiolabeled oligonucleotide is synthesized for use as a probe to screen the murine cDNA library. The 

40 probe, derived of a portion of the antisense strand of the gene coding for murine GM-CSF, has the following 
composition: 5'-TGATGGCCTCTACATGCTTCCAAGGCCGGGTAACAATTAT-3'. This probe complements 
the 5' terminal portion of the sense strand shown in FIGURE 1, and has the advantage of being short 
enough to be relatively easily synthesized, while being long enough to contain sufficient information to be 
useful as a probe for the murine GM-CSF gene. It is to be understood, however, that the composition of the 

45 probe may correspond to other portions of the murine GM-CSF gene without departing from the scope or 
spirit of the present invention. 

The synthetic oligonucleotide probe may be readily chemically synthesized by well-known techniques, 
such as by phosphodiester or triester methods. The details of the triester synthesis technique are set forth, 
for example, in Sood et al., 4 Nucl. Acid Res. 2557 (1977); and, Hirose et al., 28 Tet Lett. 2449 (1978). After 

50 synthesis, the oligonucleotide probe is labeled with T4 polynucleotide kinase and ^sp-ATP. A standard 
protocol for the labeling procedure is set forth in Maniatis et al.. supra at 122. Advantageously, the 
oligonucleotide probe can be synthesized with OH 5' termini, thereby avoiding the phosphatase procedure 
typically required. 

The murine cDNA library is screened with the synthetic radiolabeled probe as detailed Infra at page 10 
55 and in Examples 3 and 4. Plasmid DNA is then prepared from the particular positive colony identified by the 
screening procedure. 

The murine plasmid DNA is sequenced by the chain-termination method discussed infra at page 11. 
From the sequencing results, as shown in FIGURE 1. the isolated plasmid DNA was found to include 
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substantially the entire coding region of murine GM-CSF gene, plus additional sequences at the 5* end of 
the gene and an additional codon, CCA, (nucleotides 201-203), thereby confirming that the murine plasmid 
DNA Isolated from the murine cDNA library codes for murine GM-CSF. Additional differences between the 
murine GM-CSF gene and the isolated plasmid DNA are shown in FIGURE 1; none of these differences 

5 resulted in a change of the encoded amino acid. 

Initially the entire length of the isolated murine plasmid DNA was chosen as a probe for screening the 
human cDNA library prepared above. A relatively large size probe (in the range of 300-500 bp) usually 
Increases the likelihood that cDNA, actually coding for human GM-CSF would be hybridized rather than 
non-GM-CSF coding cDNA fragments. However, use of this entire murine plasmid DNA Insert as a probe 

JO was not successful; no hybridization occurred with cDNA fragments of the human library. 

Subsequently, the size of the murine probe was reduced and the human cDNA library rescreened. The 
smaller probe consisted of a fragment extending from nucleotide Nos. 45 to 400 as indicated by the solid 
underline in FIGURE 1 . and thus included only a small portion of the 3' non-coding region and none of the 
5' non-coding region of the mouse plasmid DNA fragment. As detailed below, use of this probe was 

?5 successful in isolating the human GM-CSF gene from a cDNA library. It is to be understood that probes 
corresponding to other portions of the nucleotide sequence of the murine plasmid DNA fragment may be 
employed without departing from the spirit or scope of the present invention. 

The murine cDNA probe is radiolabeled prior to being used for hybridizing to the human cDNA library 
pools. Due to the relatively large size of the probe, various labeling techniques may be employed; however, 

20 preferably the probe is labeled by "nick translation." In this well-known technique, as discussed by RIgby et 
al., 113 J. Molec. Bio. 237 (1977), and Maniatis et al.. supra at 108. nicks are introduced at widely separated 
sites in the DNA by~very limited treatment with DNase I, thereby exposing a free 3'-0H group at each nick. 
DNA polymerase I is employed to incorporate appropriate radiolabeled deoxynucleotide triphosphates (^^P- 
dNTPs), at the 3'-0H terminus and concurrently remove the nucleotide from the 5' side of the nick causing 

25 sequential movement of the nick along the DNA ("nick translation"). 

Screening of cDNA Library 

In the screening procedure of the present invention, the transformants are initially pooled Into relatively 
30 large groups each composed of approximately 100,000 transformants. The replicated plasmids are ex- 
tracted from the transformants using any one of several well-known techniques, such as by alkaline lysis. 
Plasmid DNA is prepared by cleaving the extracted plasmids with Pst 1. The resulting DNA segments are 
fractionated by electrophoresis on agarose gels and then directly analyzed by Southern blotting as 
described by Southern, 98 J. Mol. Biol. 503 (1975). The DNA fragments that bind to the nitrocellulose filter 
35 In the Southern blotting procedure are hybridized with the labeled cDNA probe. The specific DNA fragments 
that hybridize to the probe are identified by autoradiography. 

The putative pool(s) of clones that discloses a strongly hybridizing band during autoradiography Is 
subdivided into groups of approximately 5,000 transformants, and then the above-described hybridizing 
screen using the labeled murine cDNA probe is repeated. This process of subdividing putative pools of 
40 clones and screening transformants is repeated until a desired pool size is obtained. A single transformant 
that hybridizes to the labeled probe is then identified by the well-known colony hybridizing technique of 
Grunstein and Hogness, 72 Proc. Natl. Acad. Sci. 3961 (1975). By this procedure, applicants have 
discovered one such positive colony. Plasmid DNAT^esignated as pHG23, is prepared from this particular 
colony. 

45 

Characterization of Screened cDNA 

The plasmid DNA prepared above is sequenced using standard chain-termination methods. This 
technique of nucleotide sequencing was originated by Sanger et al., 70 Proc. Natl. Acad. Sci. (USA) 5463 

50 (1977). See U.S. Patent No. 4,322,499. Methods for chain-termination sequence determination are set forth 
In the Amersham Handbook entitled, Mi 3 Cloning and Sequencing , Blenheim Cresent, London (1983) 
(hereinafter "Amersham Handbook"); Messing. 2 Reco"mbinant DNA Technical Bulletin. NIH Publication No. 
79-99. 2. 43-48 (1979); Norrander et al., 26 Gene 101 (1983)TCerretti et al., 11 Nucl. Acids Res. 2599 
(1983); and, Biggin et al.. 80 Proc. Natl. Acad. Sci. (USA) 3963 (1983). M13 filamentous phage is employed 

55 as a vector to clone the DNA sequence of interest. These phage vectors provide single-stranded DNA 
templates which are readily sequenced by the chain-termination method, which involves priming a single- 
stranded template molecule with a short primer strand having a free 3' hydroxy! group and then using DNA 
polymerase (Klenow fragment) to copy the template strand in a chain extension reaction using all four 
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deoxyribonucleotide triphosphates, i.e.. dATP, dCTP. dGTP. and dTTP (collectively referred to as 
"dNTPs"), with one of the dNTPs being radiolabeled. In the synthesis reaction, a nucleotide specific chain 
terminator lacking a 3'-hydroxyl terminus, for instance, a 2', 3' dideoxynucleotide triphosphate {"ddNTP"). is 
used to produce a series of different length chain extensions. The terminator has a normal 5' terminus so 

5 that it can be incorporated into a growing DNA chain, but lacks a 3' hydroxy I terminus. Once the terminator 
has been integrated into a DNA chain, no further deoxynucleotide triphosphates can be added so that 
growth of the chain stops. Four separate synthesizing reactions are carried out. each having a ddNTP of 
one of the four nucleotide dNPTs. i.e., dATP. dCPT, dGTP and dTTP. One of the normal dNTPs is 
radiolabeled so that the synthesized strands, after having been sorted by size on a polyacrylamide gel, can 

70 be autoradiographed. The chain extensions from the four reactions are placed side by side in separate gel 
lanes so that the pattern of the fragments from the autoradiography corresponds to the nucleic acid 
sequence of the cloned DNA. 

FIGURE 2 illustrates the nucleotide sequence of the human GM-CSF gene contained in the pHG23 
plasmid DNA prepared above. The corresponding amino acid composition of the coding region of the gene 

75 is also illustrated in FIGURE 2. beginning from the Ala residue, No. 1 (nueleotide No. 14) and extending to 
the Glu residue, No. 127 (nucleotide No. 394). 

In preparation for the sequencing procedures, the plasmid DNA containing the DNA insert is subcloned 
into Ml 3 phage vectors to form single stranded DNA templates. A universal primer is used to sequence the 
sense and antisense strands. Rather than relying on the sequencing results obtained from sequencing the 

20 entire length of the fragments with a single chain-termination procedure, an additional synthetically 
produced primer is used to initiate the chain-termination procedure from an intermediate location along the 
length of the subcloned DNA fragment. The composition of the synthetically produced primer was based on 
the sequence information obtained using the universal primer. By this process, both strands of the 
subcloned DNA fragment are sequenced in overlapping fashion, thereby serving to redundantly confirm the 

25 sequences. 

It is to be understood that rather than employing the chain-termination technique outlined above, other 
known methods may be utilized to sequence cloned human cDNA inserts without departing from the spirit 
or scope of the present invention. For instance, the chemical degradation method of Maxam and Gilbert as 
set forth in 74 Proc. Nat'l Acad. Sci. (USA) 560 (1977) can be used. 

30 

Expression of Functional GM-CSF from cDNA Clone 



To determine whether the cDNA coding region of the GM-CSF gene as contained in pHG23 would 
encode functional GM-CSF, the gene is expressed in host cells and then tested for its ability to stimulate 

35 the growth of bone marrow colonies in agar. A cDNA fragment of substantially the entire coding region of 
the GM-CSF gene shown in FIGURE 2, from the Sfa Nl to Nco I fragment, is inserted into an expression 
vector, FIGURE 3. designed to direct synthesis and secretion of the mature form of GM-CSF from yeast 
host cells. The expression vector, designated as pY a fGM-2. contains sequences derived from plasmid 
pBR 322 (thick line) containing an origin of replication and the ampicillin resistance gene (Ap') The 

40 expression vector also includes sequences from yeast (thin line in FIGURE 3), including the tryptophan-1 
gene (Trp-1) as a selectable marker and the 2 u yeast origin of replication. The expression vector further 
includes the yeast pre-pro- a mating factor {" a-factor") as an efficient promoter together with leader 
sequences to direct the synthesis and secretion of GM-CSF in yeast hosts, followed immediately by the 
sequence for the coding region of GM-CSF shown in FIGURE 2. The structure of the a -factor gene is 

45 discussed in Kurjan and Herskowitz, 30 Cell 933-943 (October 1982). 

The pY a fGM-2 expression plasmid is transformed into an appropriate strain of Saccharomyces 
Cerevisiae (S. cerevisiae). Preferable strains include yeast strains Nos. 79. X2181-1B, DBY746, YNN282, 
208-12. These strains are all a, Trp 1 for compatibility with the a -factor promoter and for selection of Trp 
transformants. These strains ^re~all widely available, for instance strain 79 is available from the Yeast 

50 Genetic Stock Center, Department of Biophysics and Medical Physics, University of California, Berkeley, 
California 94702. The culture supernatants are assayed for biological activity through their ability to direct 
the formation of mixed, granulocytic and macrophage-type colonies from human bone marrow cells. As a 
control, plasmid pY a f, of the same construction as pY a fGM-2 but lacking the GM-CSF sequences was 
also transformed into a yeast host and the culture supernatant tested for biological activity. The pY a fGM-2 

55 supernatant was found to direct synthesis of high levels of GM-CSF activity in the bone marrow colony 
assay (1.25 x 10^ CFU-C/ml) whereas no activity was detected from the supernatant derived from the pY o f 
control plasmid. 
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Analysis of mRNA 

Since the availability of cells that synthesize GM-CSF is limited, the expression of GM-CSF mRNA from 
a number of cell types which have been reported previously to express other CSF was Investigated. 

5 Northern blots of RNA from these cells were analyzed by hybridization with an RNA probe derived from 
pHG23. As shown In FIGURE 4. the probe is strongly hybridized to single bands of RNA derived from 
peripheral blood T-cells activated with PMA and Con A and from HUT-102 cells (lanes 3 and 1). A low level 
of hybridization occurred in RNA from a human bladder tumor cell line (lane 6). Also, no hybridization 
occurred in RNA from unstimulated T-cells, lipopolysaccharide-stimutated macrophages and a human 

70 pancreatic tumor cell line (lane 2, 4 and 5). These results are consistant with the level of. biological activity 
found to be associated with GM-CSF derived from activated peripheral blood T-cells and from HUT-102 
cells. 

Analysis of Human Genomic Sequences 

The number of GM-CSF related genes in human genomic DNA was investigated by hybridizing a ^^P- 
labeled human GM-CSF probe to Southern blots of human genomic DNA fragments. The fragments were 
prepared by digesting human genomic DNA with a number of different restriction enzymes expected to cut 
the DNA relatively infrequently. As shown in FIGURE 5, digestion of the human genomic DNA with Hind 111. 
20 Eco Rl, or Pst I resulted in single bands, whereas digestion with Bgl II resulted in two hybridization bands. 
Based on these results, it appears that GM-CSF exists as a single copy gene in human genomic DNA. 

The processes and products of the present invention are further illustrated by the following examples. 

EXAMPLE 1 

25 

Preparation of Polyadenylated mRNA From Lymphoma T-Cells 



HUT-102 cells at a concentration of approximately 2x10^ cells per ml were cultured in 100-500 ml 
volumes in Roswell Park Memorial Institute ("RP Ml")-1640 medium supplemented with 10% (v/v) fetal calf 

30 serum ("FCS"), 2 mM glutamine, 100 U/ml penicillin and 100 micrograms per milliliter ("ug/ml") streptomy- 
cin. The cells were cultured for approximately 3-5 days In a humidified atmosphere of 5% CO2 in air. After 
this period of time, viable cells were harvested by centrifugation. 

Total RNA was extracted from the HUT-102 cells by the standard method described by Chirgwin et al., 
supra. In this procedure guanidinium thiocyanate was used to denature the cellular protein including the 

35 RNase at a rate that exceeds the rate of RNA hydrolysis by RNase. The mRNA was removed from the 
cellular protein by ultracentrifugation through a dense cushion of cesium chloride. 

Thereafter, polyadenylated mRNA was separated from the extracted protein on an oligo (dT)-cellulose 
chromatography column using the standard method disclosed by Maniatis et al.. supra at 197. Briefly, the 
column was prepared with application buffer (20 mM Tris-Cl (pH 7.6), 0.5 M NaCI, 1 mM ethylene diamine 

40 tetra acetate ("EDTA") and 0.1% sodium dodecyl sulfate ("SDS")). The protein pellet was dissolved in 
water and application buffer and then loaded onto the column. The nonadsorbed material was removed from 
the column by initial washings with application buffer followed by additional washings with application buffer 
containing 0.1 M NaCl. The retained polyadenylated mRNA was eluted with buffers of reduced ionic 
strength composed of 10 mM Tris-Cl (pH 7.5), 1 mM EDTA and 0.05% SDS. The eluted polyadenylated 

45 mRNA was precipitated at -20* C with 1/10 volume sodium acetate (3M, pH 5.2) and 2.2 volumes of ethanol. 
After elution of the polyadenylated mRNA from the oligo (dT)-celluIose column, the integrity of the 
polyadenylated mRNA was confirmed by electrophoresis through agarose gels, by the standard method set 
forth in Maniatis et al., supra at 199. 

50 EXAMPLE 1A 

Preparation of Polyadenylated mRNA From Peripheral Blood T-Lymphoma Cells 

Peripheral blood T-lymphocyte cells (mixture from Portland. Oregon Red Cross) at a concentration of 
55 approximately 2 x 10^ cells per ml were cultured in 100-500 ml volumes in RP MI-1640 medium 
supplemented with 10% (v/v) FCS. 2 mM glutamine. 100 U/ml penicillin and 100 ug/ml streptomycin, 
together with 20 ug/ml concanavalin A ("Con A") (Pharmacia Fine Chemicals, Piscataway. NJ) and 10 ug/ml 
phorbal myrlstrate acetate ("PMA") (Sigma Chemical Co.. St, Louis, MO). The cells were cultured for 
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approximately 20 hours in a humidified atmosphere of 5% CO2 in air. After this period of time, viable cells 
were harvested by centrifugation. Thereafter, total RNA was extracted from the peripheral blood T-cells and 
polyadenylated mRNA prepared from the extracted protein as described in Example 1, supra. 

5 EXAMPLE 2 

Construction of cDNA Library 

A library of double-stranded cDNA corresponding to the mRNA was prepared from the purified total 
JO mRNA in Examples 1 and 1A by employing the standard procedure detailed by Maniatis et a!., supra at 
229. Oligo-dT was hybridized to the polyadenylated tail of the mRNA to serve as the primer for the reverse 
transcription of the first cDNA strand. The enzyme avian myeloblastosis virus ("AMV") reverse transcriptase 
synthesized the first DNA strand by using the mRNA as a template. This procedure resulted in a hairpin 
loop being formed at the 3' end of the initial cDNA strand that serves as an integral primer for the second 
75 cDNA strand. After the mRNA strand had been degraded with NaOH, the second cDNA strand was 
synthesized with DNA polymerase I. The hairpin was then removed with nuclease Si to produce double- 
stranded cDNA molecules. 

The double-stranded cDNA was fractionated into size classes by Sephacryl S-400 (Pharmacia Fine 
Chemicals, Piscataway, N.J.) column chromatography and monitored by analysis using alkaline agarose 
20 electrophoresis employing end-labeled fragments of pBR322 DNA as molecular-weight markers. cDNA 
having a length of less than 500 bp was discarded to avoid needless cloning of these undersized cDNA 
fractions. 

The double-stranded cDNA fractions, as prepared above, were inserted into the Pst I site of the pBR322 
plasmid by the standard method contained in Maniatis et al., supra, beginning at 239. In this procedure, the 

25 double-stranded cDNA was tailed with poly (dC) at its 3' ends. The plasmid pBR322 (Pharmacia Fine 
Chemicals) was digested with Pst I endonuclease and then tailed with poly (dG) at its 3' ends. The tailed 
plasmid DNA and the tailed cDNA were annealed in annealing buffer (0.1 M NaCl, 10 mM Tris-CI (pH 7.8) 
and 10 mM ETDA) to form recombinant plasmids. All restriction enzymes described herein are commer- 
cially available from New England Biolabs, Beverly, Massachusetts. 

30 The recombinant plasmids were transformed into E. coli strain MM294 by using the standard procedure 
of Hanahan, supra, in which the E. coli celts were prepared by growth in elevated levels of Mg^ . The 
transformation hosts were plated~ancr~then Iransformants were identified by use of tetracycline as a 
phenotypic identifier. By this technique, applicants obtained approximately 2 x 10^ independent transfor- 
mants. 

35 

EXAMPLE 3 

Preparation of Murine GM-CSF cDNA Screening Probe 



40 Total RNA was extracted from LBRM-33-5A4 cells and polyadenlylated mRNA was separated therefrom 
on an oligo (dT)-cellulose chromatography column, using the protocols set forth in Examples 1 and 1A. The 
LBRM-33-5A4 cell line is available from the American Type Culture Collection, 12301 Park Lawn Drive, 
Rockville, MD 20852, U.S.A., No. ATCC-CRL-8080. The integrity of the resulting polyadenylated mRNA was 
confirmed by agarose gel electrophoresis. A library of double-stranded cDNA corresponding to the murine 

45 mRNA was prepared by the method set forth above in Example 2. The resulting double-stranded cDNA 
fractions of sizes greater than 500 bp were inserted into the Pst I site of the pBR322 plasmid by the 
homopolymeric tailing method set forth in Example 2. The recombinant plasmids were transformed into E. 
coli strain MM294 and then the transformants were identified by use of tetracycline as a phenotypic 
identifier. By this process, applicants identified approximately 6x10* independent transformants. 

50 A synthetic oligonucleotide probe was chemically synthesized by standard triester method, as detailed 
by Sood et a!., supra, and Hirose et al., supra, and then radiolabeled with ^^P for use in screening the 
murine cDNA library. The probe was composed of the following nucleotide sequence: 5'-TGATG 
GCCTCTACATGCTTCCAAGGCCGGGTAACAATTAT-3'. To facilitate labeling, the 5' ends of the 
oligonucleotides are synthesized with OH termini, thereby eliminating the phosphatase treatment which 

55 typically must be employed when labeling DNA fragments. The labeling protocol included adding one 
microliter ("ul") of the synthetic oligonueleotides to 16 ul of ^^r-aTP (7000 Ci/mM), 1 ul (10 U) of T4 
polynueleotide kinase and 2 ul of 10 x kinase buffer 1(0.5 M Tris-Cl (pH 7.6), 0.1 MgCb. 50 mM 
dithiothreitol, 1 mM spermidine and 1 mM ETDA). The reaction was carried out at 37'C for thirty minutes, 
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and thereafter the synthesized oligonucleotides were extracted with phenol/chlorofornn. The labeled probes 
were separated from unlabeled oligonucleotides by chromatography on Sephadex G-50 columns 
(Pharmacia Fine Chemicals). 

To facilitate initial screening of the murine cDNA library, the transformed bacterial cultures were 

5 grouped into pools, each having approximately 3000 different clones. Ptasmid DNA was removed from 
samples of the host bacteria by standard alkaline lysis method detailed by Ish-Horowicz and Burke, 9 Nucl. 
Acids Res. 2989 (1981). The isolated plasmlds were digested to completion with Pvu II and Hind III by 
standard~procedures. Next, the plasmid digests were fractionated by electrophoresis through 0.8% agarose 
gel and then blotted onto nitrocellulose filter by the standard method of Southern, supra. The DNA that 

70 bound to the nitrocellulose filter was hybridized with the labeled synthetic oligonucleotide probe using the 
procedure detailed in Example 4, infra. The putative pool(s) of clones from which hybridizing bands of DNA 
were obtained was screened by direct colony hybridization with the radiolabeled synthetic probe, and a 
single positive colony was identified. 

Plasmid DNA was prepared from the identified positive colony by the procedures set forth above and 

75 then sequenced as discussed in Example 5, infra. The nucleotide sequence of the isolated murine plasmid 
DNA fragment, as shown in FIGURE 1, was found to include substantially the nucleotide sequence of the 
reading frame of murine GM-CSF gene plus additional nucleotides at the 5' terminus of the gene and one 
additional codon at an Intermediate location along the coding region of the gene (nucleotide Nos. 201-203), 
thus confirming that the purified murine plasmid DNA codes for GM-CSF. As shown in FIGURE 1, several 

20 additional nonsignificant differences existed In the third nucleotides of several codons. which differences did 
not cause a change in the encoded amino acid. 

The 356 bp fragment of the murine GM-CSF cDNA clone defined by the solid underline in FIGURE 1 
(nucleotide No. 45 to nucleotide No. 400) was selected as a probe for screening the human plasmid DNA 
prepared in Example 2 above. The probe fragment was removed from the murine cDNA clone by double 

25 digestion with the restriction enzymes Pst I and Hae III followed by agarose gel electrophoresis. 

The cDNA nucleotide probe was radiolabeled by nick translation by the standard procedure set forth in 
Maniatis et al., supra at 108 and discussed above at page 9. By this procedure, the probe was labeled to a 
specific activity of approximately 5 x 10^ CPM/ug DNA. Prior to use In screening protocols, the labeled 
probe was denatured by boiling in water at 100° C for ten minutes followed by chilling on ice. 

30 

EXAMPLE 4 

Screening of cDNA Library 

35 To facilitate initial screening of the cDNA library prepared In Example 2 above, the transformed bacteria 
cultures were grouped into pools each having approximately 100,000 different clones. Plasmid DNA was 
removed from samples of the host bacteria by standard alkaline lysis method detailed by Ish-Horowicz and 
Burke, 9 Nucl. Acids Res. 2989 (1981). The isolated plasmids were cleaved with Pst I and then fractionated 
by electrophoresis through 1.0% agarose gel with markers of appropriate size. The agarose gel was blotted 

40 onto nitrocellulose filter using the method described by Southern, supra. After the transfer process, the filter 
was air dried and baked for two hours at approximately 80* C under a vacuum to bind the DNA fragments 
to the nitrocellulose. 

The bound DNA was next hybridized with the labeled cDNA probe. Briefly, the baked nitrocellulose was 
incubated at 55* C for 2-4 hours in prehybridlzation buffer composed of 6 x SSC. 0.5% NP40 detergent, 

45 0.1% sarcosyl, 5 x Denhardt's solution (0.02% Ficoll. 0.02% polyvinyl pyrrolidone. 0.02% BSA) and 100 
ug/ml denatured salmon sperin DNA (Sigma Type ill, sodium salt). The filter was then Incubated overnight 
at 55' C with the ^^p-iabeled cDNA probe (10^ cpm/ml) (from Example 3) in hybridizing solution as above. 
After overnight hybridization, the filter was washed extensively with 6 x SSC at room temperature and then 
for 1 hour at 42* C and then for 1.5 hours at 55' C with 6 x SSC. After air drying, the filter was subjected to 

50 autoradiography at - 70 * C. 

From the autoradiography, applicants found a number of strongly hybridizing bands. One putative pool 
of clones from which the plasmid DNA that produced a strongly hybridizing band was obtained was 
subdivided into pools of approximately 7,000 transformants and the hybridization screening procedure 
repeated. The putative subpool from which a strongly hybridizing band of DNA was seen was then plated. 

55 The resulting colonies were probed with the radiolabeled cDNA nucleotide probe by the well-known 
methods of Grunstein and Hogness. supra, using the hybridizing conditions described above. By this 
process, a single positive host colony was identified. 
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EXAMPLE 5 

Characterization of Screened cDNA 

5 Plasmid. designated as pHG23, was prepared with cDNA from the identified positive colony by the 
procedures set forth in Example 4. Samples of the host plasmid transformed into E. coli are on deposit with 
the ATCC under Accession No. 39900. The cDNA inserts prepared from the plasnnid DNA removed from 
the positive host colony was sequenced by standard chain-termination protocol essentially as described in 
the Amersham Handbook, supra, with the variations set forth below. The cDNA insert was digested wish Pst 

10 I and/or Rsa I and then subcloned into strains mp18 and mp19 of the Ml 3 single-stranded filamentous 
phage vector' (Amersham, Arlington Heights, IL). The mp18 and mpl9 phage vectors, as set forth in 
Norrander et al., supra, contain the following unique cloning sites: Hind III; Sph 1; Pst I; Sal I; Acc I; Nine II; 
Xba I; BamHI; Xma I; Sma t; Kpn I; Sst I; and, EcoRI. The composition of the mpl8 and mp19 vectors are 
identical, with the exception that the order of the above-identified restriction sites are reversed in the mp19 

75 vector so that both strands of the cDNA insert may be conveniently sequenced with the two vectors. The 
mp18 and mpl9 vectors, with a corresponding strand of the cDNA inserted therein, were used to transform 
E. coli JM107 of the strain K12 (Bethesda Research Laboratories, Bethesda, MD) to produce relicate single- 
iFranded DNA templates containing single-stranded inserts of the sense and antisense strands. 

The synthetic universal primer: 5'-CCCAGTCACGACGTT-3' (P-L Biochemicals. Milwaukee, Wl), was 

20 annealed to the single-strand DNA templates and used to prime DNA synthesis as described above at page 
10. Thereafter, the extension fragments were size-separated by gel electrophoresis and auto-radiographed 
from which the nucleotide sequences of the fragments were deduced. 

An additional primer of the composition 5'-GGCTGGCCATCATGGTC-3 was employed to prime synthe- 
sis from an intermediate location along the sense strands of the cDNA clones. The composition of the 

25 additional primer strand was established by sequencing of the cDNA subclones using the universal primer. 
By the above "walk down" method, the strands of the cDNA clones were sequenced in an overlapping, 
redundant manner thereby confirming their nucleotide sequences. It Is to be understood that other synthetic 
primers could have been employed to initiate chain extensions from other locations along the strands of the 
cDNA clone, without departing from the scope of the present intention. 

30 Deoxyadenosine 5' (alpha-PS] thio) triphosphate (hereinafter "dATP [alpha-^^S]") was used as the 
radioactive label in the dideoxy sequencing reactions. Also, rather than using the gel set forth at page 36 of 
the Amersham Handbook, a 6% polyacrylamide gel was employed (6% polyacrylmide gel, 0.4 mm thick, 
containing 7 M, urea 100 mM Tris borate (pH 8.1), and 2 mM EDTA). 

As noted above, the nucleotide sequence of the cDNA is illustrated in FIGURE 2. The coding region of 

35 the human GM-CSF gene extends from nucleotide No. 14 (Ala residue) to nucleotide No. 394 (Glu residue), 
as shown in FIGURE 2. The corresponding amino acids, as determined by the nucleotide sequence, are set 
forth below the codons. 

EXAMPLE 6 

40 

Expression of Mature GM-CSF 

Substantially the entire coding region and a portion of the 3' flanking region of the GM-CSF gene was 
removed from the cDNA clone of FIGURE 2 and employed to form a recombinant expression plasmid, 

45 designated as pY a fGM-2 to direct GM-CSF expression in yeast host cells. The pY a fGM-2 expression 
plasmid is on deposit with the ATCC under Assession No. 53157. As shown in FIGURE 3, pY a fGM-2 
includes an origin of replication and an Ap" resistant gene from plasmid pBR322 (thick line portion). The 
expression plasmid also includes the yeast 2u circle origin of replication and a Trp I gene for selection of 
transformed yeast host (Trp - [Trp-auxotrophs], thin line portion in FIGURE 3). The expression plasmid 

50 further includes the o -factor promoter and leader sequences used to direct transcription and secretion of 
GM-CSF (solid box portion). The GM-CSF sequences (hatched box portion) are fused to the a -factor 
sequences with a synthetic oligonucleotide as discussed more fully below. 

Substantially the entire coding region of the GM-CSF gene, from the Sfa Nl to the Nco I site, was 
removed from the pHG23 clones by use of Sfa Nl and Nco I restriction enzymes in a standard protocol, for 

55 instance as set forth in Maniatis et al., supra at 104. The GM-CSF gene segment was cleaved from the 
pHG23 clone at the Sfa Nl site, which is located two nucleotides down stream from the 5' terminus of the 
region coding for the mature protein (nucleotide No. 14), since no restriction site was found to correspond 
precisely to nucleotide No. 14. An oligonucleotide was chemically synthesized to add back the 5' terminal 
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portion of the coding region of the mature GM-CSF gene and also to add a second a -factor processing site 
to obtain complete processing of the signal for secretion of the mature form of GM-CSF. The composition of 
the oligonucleotide, as shown in Table I below, and in FIGURE 3, includes a Hind III cohesive 5' terminal, 
followed by a cathepsin B-like maturation site composed of the sequence: TCT TTG GAT AAA AGA, and a 
5 Sfa Nl cohesive 3' terminus coding for the first two amino acid residues of the mature GM-CSF protein. 
Although the oligonucleotide shown in Table I is chemically synthesized by triester technique as detailed by 
Sood et al., supra and Hirose et al.. supra, it is to be understood that the oligonucleotide can be prepared 
by other methods, such as by the phosphodiester method. 

TABLE 1 

5^ A GCT TCT TTG GAT AAA AGA GO 3^ 

AGA AAC OTA TTT TCT CGT GGG 

'5 Ser Leu Asp Lys Arg Ala Pro 

It Is to be understood that other standard recombinant DNA techniques could be used to generate the 
same expression vector, and that the construction detailed above is representative of various strategies that 

20 could be used to prepare a GM-CSF cDNA fragment for insertion into the pY a fGM-2 expression vector. 

The pY a fGM-2 was transformed into yeast strain 79 (a, Trp 1-1, Leu 2-1) of S. cerevisiae for selection 
of Trp* transformants by standard techniques. Prior to transformation, "the strain 79" was grown in culture in 
YP-giucose medium to a density of 2x10^ cells/ml. Cells were harvested by centrifugation at 1000 x g for 
5 minutes at 22* C, and then the resulting pellet was washed with sterile, distilled water. 

25 The yeast cells were then concentrated by resuspending in 1/10 vol. of SED (1 M sorbitol, 25 mM 
EDTA [pH 8.0], and 50 mM dithiothreitol) and incubating for 10 minutes at 30 'C. The cell-buffer mixture 
was then centrifuged for 5 minutes at 300 x g. The pellet was washed once with 1/10 vol. of 1 M sorbitol 
and the cells resuspended in 20 milliliters of SCE (1 M sorbitol, 0.1 M sodium citrate [pH 5.8], 0.01 M 
EDTA). Glusulase, to break down the cell walls, in an amount of 10"^ vol. was added to the solution and 

30 then the solution incubated at 30 *C for 30 minutes with occasional gentle shaking. The presence of 
spheroplasts was assayed by diluting 10 microliters of the yeast cells into a drop of 5% SDS (wt./vol.) on a 
microscope slide to observe for "ghosts" at 400 X phase contrast. The cell mixture was then centrifuged at 
300 X g for 3 minutes. The resulting pellet was twice washed with 1/10 vol. of 1 M sorbitol. The pellet was 
then once washed in CaS (1 M sorbitol. 10 mM CaCb). 

35 The yeast spheroplasts were then transformed with the previously prepared plasmid vector in a 
procedure adapted from Beggs, supra. The pelleted spheroplasts were suspended in 1/200 vol. of CaS and 
then divided into 100 microliter aliquotes in 1.5 ml Eppendorf tubes. Then, from 1 to 10 ul of the plasmid 
DNA were added to each aliquot (0.5 to 5 ug). The mixture was incubated at room temperature for 10 
minutes and then 1 ml of PEG (20% PEG 4000, 10 mM CaCb. 10 mM Tris-HCI [pH 7.4]) was added to 

40 each aliquot to promote DNA uptake. After 10 minutes at room temperature, the mixture was centrifuged for 
5 minutes at 350 x g. The resulting pellet was resuspended in 150 ul of SOS (10 ml of 2 M sorbitol, 6.7 ml 
of YEP [1% (wt/vol) yeast extract, 2% (wt/vol) peptone, 2% (wt/vol) glucose], 0.13 ml of 1 M CaCb, 27 ul of 
1% tryptophane and 3.7 ml of water). This mixture was incubated for 20 minutes at 30 'C. The cells were 
then plated, or held at 4* C for up to a few days. 

45 Prior to plating the protoplast/DNA mixture selective plates were preincubated at 37* C. Three ml of 
melted top agar (45 *C), composed of 18.2 ml of sorbitol, 2 gm agar, 0.6 gm Difco yeast nitrogen base 
(without amino acids), 2 gm glucose, 0.1 ml of 1% adenine. 0.4 ml of 1% uracil and amino acids as 
required, was then added to each aliquot of transformed cells and the tube contents poured on the selective 
plates. The plates were incubated from 2 to 4 days at 30* C. Colonies which developed in the Trp minus 

50 medium contained plasmids that have the Trp 1 gene, i.e., those that are transformed. 

Prior to biological assay, the transformants were grown in 20-50 ml of rich medium (1% yeast extract, 
2% peptone, 2% glucose) at 33' C to stationary phase. At the time of harvest, the protase inhibitors phenyl 
methyl sulfonyl (P MSF) and Pepstatin A were added to a final concentration of 1 mM and 10 uM. 
respectively. The cells were then removed by centrifugation at 400 x g and the medium was filtered through 

55 a 0.45 u cellulose acetate filter. 

EXAMPLE 7 
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Colony Assay 

The presence of human GM-CSF harvested Uom the yeast cultures in Example 6 was confirmed by 
assaying the ability of the supernatant to stimulate growth of human bone marrow colonies in agar. For use 

5 in the assay, human bone marrow from the iliac crest of healthy donors was collected in a heparinized 
syringe. The marrow was diluted 1:3 with phosphate buffered saline (PBS) at room temperature and layered 
onto a solution of 54% percoll (Pharmacia Fine Chemicals). After centrifugation at 500 x g at room 
temperature for 20 minutes, the interface was collected and washed with 20 volumes of PBS. The 
suspension was then centrifuged at 250 x g for 10 minutes at room temperature. The cells were then 

10 resuspended in 10 ml of a -Minimal Essential Medium with nucleotides (a -Mem. Gibco) for cell counting 
and viability determination. FCS was then added and the cell suspension stored on ice until the assay was 
carried out. 

In the assay, bone marrow cells as prepared above were added at a final concentration of 1 x 10^/ml to 
an incubation medium consisting of: (a) seven parts of a solution containing 28.1% FCS, 0.7 x 10~* M 2- 

75 Mercaptoethanol, 0.12 mg/ml asparagine, 0.7 mg/ml glutamine, 150 units of penicillin G, 150 units of 
streptomycin, 1.1 x a-MEM with nucleotides, and 2.2 x vitamins (Gibco); and, (b) three parts of 1.4% bacto- 
agar solution (Difco). The cultures were incubated in a humidified atmosphere at 37* C in the presence of 
5% CO2. After seven to fourteen days of culture, the number and types of colonies, whether granulocyte, 
macrophage or mixed granulocyte-macrophage, were determined. Applicants found that the GM-CSF gene 

20 from the pY a fGM-2 clones directed synthesis of GM-CSF activity at the high level of 1.25 x 10^ colony 
forming units ("CFU") per milliliter. This activity level was determined by multiplying by 50 the reciprocal of 
the dilution giving 50% of the maximum colony number. Applicants have found that the average number of 
colonies from 1 x 10^ bone marrow cells was 96 ± 29. The colonies formed at 14 days by the recombinant 
GM-CSF were well defined and consisted of three types: approximately 1/3 mixed granulocyte-macrophage 

25 colonies; approximately 1/3 tight granulocyte colonies, and approximately 1/3 dispersed macrophage 
colonies. 

As a further control for the expression system of the present invention, a plasmid identical to pY o fGM- 
2, but lacking the GM-CSF sequences, was also transformed into yeast strain 79. The culture supernatant 
from the yeast produced no GM-CSF activity in the bone marrow colony assay. 

30 As a positive control, supernatants from human placental cells, a natural source of GM-CSF. were also 
tested for colony forming activity. The human placental cells were cultured at 1.2 x 10^/ml for 6 days in the 
presence of 5% fetal bovine serum. When tested in the bone marrow assay, the supernate from the 
cultured placenta cells were found to have an activity of approximately 5 x 10^ CFU-C/ml, and to give 
approximately the same relative levels of each colony type: 1/3 mixed granulocyte-macrophage colonies, 

35 1/3 tight granulocyte colonies and 1/3 dispersed macrophage colonies. 

EXAMPLE 8 

Analysis of mRNA 

40 

The expression of GM-CSF mRNA from a number of cell types, which have been reported previously to 
express other CSF, was investigated. Northern blots of RNA from these cells were analyzed by hybridiza- 
tion with a probe derived from pHG23. In this regard, total RNA for Northern blots was isolated by the 
guanidium thiocyanate/cesium chloride method as set forth supra in Example 1, from the following cells: (1) 

45 Hut-102 cells; (2) unstimulated peripheral blood T-cells; (3) peripheral blood T-cells stimulated with Con A 
and PMA as described supra in Example 1A; (4) peripheral blood macrophages stimulated with llpopolysac- 
charide; (5) pancreatic carcinoma cell line 1420 (from ATCC); and, (6) bladder carcinoma cell line 5637 
(from ATCC). The RNA samples were sized by electrophoresis in 1.1% agarose gels containing formal- 
dehyde to denature the RNA so that the rate of migration of the RNA through the gel was in proportional to 

50 its molecular weight. A standard protocol for electrophoresis of the RNA through agarose gels containing 
formaldehyde is set forth in Maniatis et al.. supra at 202. 

After electrophoresis the formaldehyde-denatured RNA was transferred to nitrocellulose filters using a 
standard protocol as detailed in Maniatis et al.. supra at 203, for subsequent hybridization with 32p-iabeled 
RNA probe transcribed In vitro by SP6 polymerase by the procedure set forth in Green et al.. 32 Cell 681 

55 (March 1981). The ^P-RNA probe was synthesized from the 600 base pair Pst I to Nco I fragment of 
pHG23 which was subcloned into the pSP64 vector (Promega Biotech). The RNA bound to the nitrocel- 
lulose filters was hybridized with the labeled RNA probe (10^ cpm/ml) for 16 hours at 63 *C in Stark's 
complete buffer: 5 x SSC; 50 mM KH2 PO^ [pH 2.5]; 150ug/ml denatured salmon sperm DNA; 2 x 
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Denhardt's solution [0.04% (wt/vol) Ficoll, 0.04% (wt/vol) polyvinyl pyrrolidone, 0.04% (wt/vol) BSA]; 0.1% 
SDS; 20 mM Na2 EDTA; and, 50% (wt/vol) formamide. Atter hybridiHation, the filter was washed at 63* in 6 
X SSC for two hours and in 0.1 x SSC for two hours and then autoradiographed for four hours with 
intensifying screens at -70' . 

5 The results of the autoradiography are set forth in FIGURE 4 which shows strong hybridization to a 
band of approximately 900 nucleotides in 5 ug of total RNA from PMA and Con A activated peripheral blood 
T-cells and from Hut 102 cells (lanes 3 and 4). A low level of hybridization was seen in 1.5 ug of 
polyadenylated RNA from the human blood carcinoma cell line 5637 (lane 6). No hybridization was seen In 
5 ug of total RNA from unstimulated peripheral blood T-cells (lane 2), 1 .5 ug of polyadenylated RNA from 

10 lipopolysaccharide-stimulated peripheral blood macrophages (lane 4) and 1.5 ug of polyadenylated RNA 
from human pancreatic carcinoma cell line 1420 (lane 5). The other high molecular weight bands shown in 
FIGURE 4, 18S and 28S, are due to the hybridization of the probe to ribosomal RNA. The results from the 
Northern blot analysis are consistent with the high level of biological activity found to be associated with 
GM-CSF derived from activated peripheral blood T-cells and HUT-102 cells. 

75 

EXAMPLE 9 

Analysis of Human Genomic Sequences 



20 To determine the number of GM-CSF-related genes in human genomic DNA, a labeled GM-CSF cDNA 
probe was hybridized to Southern blots of human DNA digested with restriction enzymes expected to cut 
relatively infrequently. The probe included the entire pHG23 sequence (Figure 2) 3' of nucleotide number 
161 and was radiolabeled by nick translation by the standard procedure set forth in Maniatis et al., supra at 
108 and discussed at page 10 herein. Prior to hybridization, 10 ug of human genomic DNA was digested to 

25 completion with Hind 111, Eco Rl, Pst I, or Bgl II using standard techniques. The digested human DNA was 
fractionated by electrophoresis in a 0.7% agarose gel with markers of appropriate size. The agarose gel 
was blotted onto nitrocellulose filters using the method described by Southern, supra. After the transfer 
process the filter was air-dried and baked for two hours at 80 " C to bind the DNA fragments to the 
nitrocellulose. Thereafter, the bound DNA was hybridized with the labeled cDNA probe as set forth in 

30 Example 4. supra, then washed extensively in 2 x SSC, 0.5% SDS at room temperature followed by 
washing in 0.1 x SSC, 0.5% SDS for 45 minutes at 65* C. After air drying, the filter was subjected to 
autoradiography at -70 ' C. 

The results of autoradiography are set forth in FIGURE 5. In FIGURE 5 the molecular weight markers (in 
kilobase pairs) are from Hind Ill-digested bacteriophage X DNA. As shown in FIGURE 5, human DNA 

35 digested with Hind III, Eco Rl, and Pst I (lanes 1 , 2, 3, respectively) resulted in single bands, while digestion 
with Bgl II gave rise to two bands. On this basis, it appears that GM-CSF exists as a single copy gene in 
human genomic DNA. 

Claims 

40 Claims lor the following Contracting States: BE, PR, DE, IT, LU, NL, SE, CH, LI, GB 

1. A recombinant DNA expression vector comprising a promoter that directs expression in a yeast host of 
a recombinant DNA coding sequence which: 

45 

(a) encodes a mature human granulocyte macrophage colony stimulating factor (GM-CSF) protein 
having N-terminal alanine-proline residues, which protein is capable of stimulating growth of human 
bone marrow colonies in the human bone marrow colony assay; 

(b) hybridizes to a radiolabeled single-stranded DNA probe consisting of a Pstl-Hae III fragment of 
50 the murine GM-CSF gene corresponding to nucleotides 45 through 400 indicated in Figure 1, after 

overnight hybridization in 6xSSC at 55°C followed by washing with 6xSSC: and 

(c) is fused at its 5' terminus to a leader sequence derived from a yeast mating pheromone gene 
that directs secretion of said mature human GM-CSF protein into culture medium upon cleavage of 
such leader from said N-terminal alanine-proline residues. 

55 

2, A recombinant DNA expression vector according to claim 1, wherein the recombinant DNA coding 
sequence encodes a mature human GM-CSF protein having the amino acid sequence: 
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70 



20 





Pro 




Arg 


ocxr 




oer 




oer 


i nr 




Fro 


Trp 


(jIU 


nl S 


vai 


Asn 


Aia 


i ie 


tain 


GiU 


Ala 


Arg 


Arg 


Leu 


Leu- 


Asn 


Leu 


Ser 


Arg 


Asp 


Tnr 


Ala 


Ala 


Glu 


Met 


Asn 


GiU 


Tnr 


Vai 


GlU 


Val 


lie 


Ser 


GiU 


Met 


Pne 


Asp 


Leu 


Gin 


VjIU 


Pro 


i nr 


Cys 


Leu 


Pin 


j. nr 


Arg 


Leu 


(jiU 


Leu 


Tyr 


Lys 


Gin 


Gly 


Leu 


Arg 


Gly 


Ser 


Leu 


Thr 


Lys 


Leu 


Lys 


Gly 


Pro 


Leu 


Thr 


Met 


Met 


Ala 


Ser 


His 


Tyr 


Lys 


Gin 


His 


Cys 


Pro 


Pro 


Thr 


Pro 


Glu 


Thr 


Ser 


Cys 


Ala 


Thr 


Gin 


He 


lie 


Thr 


Phe 


Glu 


Ser 


Phe 


Lys 


Glu 


Asn 


Leu 


Lys 


Asp 


Phe 


Leu 


Leu 


Val 


He 


Pro 


Phe 


Asp 


Cys 


Trp 


Glu 


Pro 


Val 


Gin 


Glu. 









3. A recombinant DNA expression vector according to claim 2. wherein the recombinant DNA coding 
, sequence consists of the nucleotide sequence: 

25 



35 



GCA 


CCC 


GCC 


CGC 


TCG 


CCC 


AGC 


CCC 


AGC 


ACA 


CAG 


CCC 


TGG 


GAA 


CAT 


GTG 


AAT 


GCC 


ATC 


CAG 


GAA 


GCC 


CGG 


CGT 


CTC 


CTG 


AAC 


CTG 


AGT 


AGA 


GAG 


ACT 


GCT 


GCT 


GAG 


ATG 


AAT 


GAA 


ACA 


GTA 


GAA 


GTC 


ATC 


TCA 


GAA 


ATG 


TTT 


GAC 


CTC 


CAG 


GAG 


CCG 


ACC 


TGC 


CTA 


CAG 


ACC 


CGC 


CTG 


GAG 


CTG 


TAG 


AAG 


CAG 


GGC 


CTG 


CGG 


GGC 


AGC 


CTC 


ACC 


AAG 


CTC 


AAG 


GGC 


CCC 


TTG 


ACC 


ATG 


ATG 


GCC 


AGC 


CAC 


TAC 


AAA 


CAG 


CAC 


TGC 


CCT 


CCA 


ACC 


CCG 


GAA 


ACT 


TCC 


TGT 


GCA 


ACC 


CAG 


ATT 


ATC 


ACC 


TTT 


GAA 


AGT 


TTC 


AAA 


GAG 


AAC 


CTG 


AAG 


GAC 


TTT 


CTG 


CTT 


GTC 


ATC 


CCC 


TTT 


GAC 


TGC 


TGG 


GAG 


CCA 


GTC 


CAG 


GAG. 









50 4. A recombinant DNA expression vector according to any of claims 1 to 3, wherein the DNA encoding the 
human GM-CSF protein is linked to the yeast a-factor leader sequence. 

5. The recombinant DNA expression vector pYafGM-2 (ATCC 53157). 

55 6. A yeast host cell transformed with a recombinant DNA expression vector according to any of claims 1- 
5. 

7. A process for producing a mature human granulocyte-macrophage colony stimulating factor (GM-CSF) 
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protein, comprising growing a yeast host according to claim 6 in a culture medium and recovering the 
human GM-CSF protein from the culture medium. 

Claims for the following Contracting State: AT 

1. A process for the preparation of a recombinant DNA expression vector comprising a promoter that 
directs expression in a yeast host of a recombinant DNA coding sequence which: 



JO (a) encodes a mature human granulocyte macrophage colony stimulating factor (GM-CSF) protein 

having N-terminal alanine-proline residues, which protein is capable of stimulating growth of human 
bone marrow colonies in the human bone marrow colony assay; 

(b) hybridizes to a radiolabeled single-stranded DNA probe consisting of a Pstl-Hae It! fragment of 
the murine GM-CSF gene corresponding to nucleotides 45 through 400 indicated in Figure 1, after 

15 overnight hybridization in 6xSSC at 55°C followed by washing with 6xSSC; and 

(c) is fused at its 5' terminus to a leader sequence derived from a yeast mating pheromone gene 
that directs secretion of said mature human GM-CSF protein into culture medium upon cleavage of 
such leader from said N-terminal alanine-proline residues; 

20 the process comprising coupling together successive nucleotides 

2. A process according to claim 1 wherein the recombinant DNA coding sequence encodes a mature 
human GM-CSF protein having the amino acid sequence: 

25 



Ala 


Pro 


Ala 


Arg 


Ser 


Pro 


Ser 


Pro 


Ser 


Thr 


Gin 


Pro 


Trp 


Glu 


His 


Val 


Asn 


Ala 


He 


Gin 


Glu 


Ala 


Arg 


Arg 


Leu 


Leu 


Asn 


Leu 


Ser 


Arg 


Asp 


Thr 


Ala 


Ala 


Glu 


Met 


Asn 


Glu 


Thr 


Val 


Glu 


Val 


He 


Ser 


Glu 


Met 


Phe 


Asp 


Leu 


Gin 


Glu 


Pro 


Thr 


Cys 


Leu 


Gin 


Thr 


Arg 


Leu 


Glu 


Leu 


Tyr 


Lys 


Gin 


Gly 


Leu 


Arg 


Gly 


Ser 


Leu 


Thr 


Lys 


Leu 


Lys 


Gly 


Pro 


Leu 


Thr 


Met 


Met 


Ala 


Ser 


His 


Tyr 


Lys 


Gin 


His 


Cys 


Pro 


Pro 


Thr 


Pro 


Glu 


Thr 


Ser 


Cys 


Ala 


Thr 


Gin 


He 


He 


Thr 


Phe 


Glu 


Ser 


Phe 


Lys 


Glu 


Asn 


Leu 


Lys 


Asp 


Phe 


Leu 


Leu 


Val 


He 


Pro 


Phe 


Asp 


Cys 


Trp 


Glu 


Pro 


Val 


Gin 


Glu. 









3. A process according to claim 2, wherein the recombinant DNA coding sequence consists of the 
nucleotide sequence: 

50 



55 
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5 



GCA 


CCC 


GCC 


CGC 


TCG 


CCC 


AGC 


CCC 


AGC 


ACA 


GAG 


CCC 


TGG 


GAA 


CAT 


GTG 


AAT 


GCC 


ATC 


CAG 


GAA 


GCC 


CGG 


CGT 


CTC 


CTG 


AAC 


CTG 


AGT 


AGA 


GAG 


ACT 


GCT 


GCT 


GAG 


ATG 


AAT 


GAA 


ACA 


GTA 


GAA 


GTC 


ATC 


TCA 


GAA 


ATG 


TTT 


GAC 


CTC 


CAG 


GAG 


CCG 


ACC 


TGC 


CTA 


CAG 


ACC 


CGC 


CTG 


GAG 


CTG 


TAG 


AAG 


CAG 


GGC 


CTG 


CGG 


GGC 


AGC 


CTC 


ACC 


AAG 


CTC 


AAG 


GGC 


CCC 


TTG 


ACC 


ATG 


ATG 


GCC 


AGC 


CAC 


TAG 


AAA 


PAH 


PAP 


TCP 






ACC 


CCG 


GAA 


ACT 


TCC 


TGT 


GCA 


ACC 


CAG 


ATT 


ATC 


ACC 


TTT 


GAA 


AGT 


TTC 


AAA 


GAG 


AAC 


CTG 


AAG 


GAC 


TTT 


CTG 


CTT 


GTC 


ATC 


CCC 


TTT 


GAC 


TGC 


TGG 


GAG 


CCA 


GTC 


CAG 


GAG, 









25 

4. A process according to any of claims 1 to 3, wherein the DNA encoding the human GM-CSF protein is 
linked to the yeast a-factor leader sequence. 

5. A process for the preparation of the recombinant DNA expression vector pYafGM-2 (ATCC 53157), the 
30 process comprising coupling together successive nucleotides. 

6. A process for the preparation of a transformed yeast host cell, the process comprising transforming a 
yeast host cell with a recombinant DNA expression vector according to any of claims 1 to 5. 

35 7. A process for producing a mature human granulocyte-macrophage colony stimulating factor (GM-CSF) 
protein, comprising growing a yeast host prepared by a process according to claim 6 in a culture 
medium and recovering the human GM-CSF protein from the culture medium. 

Revendications 

40 Revendications pour les Etats contractants suivants: BE, PR, DE, IT, LU, NL, SE, CH, LI, GB 

1. Un vecteur d'expression d'ADN recombinant comprenant un promoteur qui dirige I'expression dans une 
cellule de levure hote ou une sequence de codage d'ADN recombinant qui: 

45 

(a) code une proteine de facteur stimulant de colonies de macrophages de granulocytes (GM-CSF) 
humain mOr poss^dant des r^sidus N-terminaux alanine-proline, laquelle proteine est capable de 
stimuler la croissance de colonies de moelle osseuse humaine dans I'essai de colonie de moelle 
osseuse humaine; 

50 (b) s'hybride avec un echantillon a un seul brin d'ADN radioactive compose d'un fragment Pstl-Hae 

III du g^ne GM-CSF murin correspondant aux nucleotides 45 a 400 indiques dans la figure 1, aprfes 
hybridation pendant une nuit entiere dans du 6xSSC ^ 55* C suivie d'un lavage au 6xSSC; et 
(c) est fusionn^e en son terminal 5' avec une sequence de tete d^riv^e d'un gene de ph^romone de 
maturation de la levure qui dirige la s^cr^tion de ladite proteine de GM-CSF humain mOr dans le 

55 milieu de culture par clivage de cette tete desdits residus N-terminaux alanine-proline. 

2. Un vecteur d'expression d'ADN recombinant selon la revendicatlon 1 dans lequel la sequence de 
codage d'ADN recombinant code une proteine de GM-CSF humain mOr possedant la sequence 
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d'acides amines suivante: 



5 



Ala 


Pro 


Ala 


Arg 


Ser 


Pro 


Ser 


Pro 


Ser 


Thr 


Gin 


Pro 


Trp 


Glu 


His 


Val 


Asn 


Ala 


He 


Gin 


Glu 


Ala 


Arg 


Arg 


Leu 


Leu 


Asn 


Leu 


Ser 


Arg 


Asp 


Thr 


Ala 


Ala 


Glu 


Met 


Asn 


Glu 


Thr 


Val 


Glu 


Val 


He 


Ser 


Glu 


Met 


Phe 


Asp 


Leu 


Gin 


Glu 


Pro 


Thr 


Cys 


Leu 


Gin 


Thr 


Arg 


Leu 


Glu 


Leu 


Tyr 


Lys 


Gin 


Gly 


Leu 


Arg 


Gly 


Ser 


Leu 


Thr 


Lys 


Leu 


Lys 


Gly 


Pro 


Leu 


Thr 


Met 


Met 


Ala 


Ser 


His 


Tyr 


Lys 


Gin 


His 


Cys 


Pro 


Pro 


Thr 


Pro 


Glu 


Thr 


Ser 


Cys 


Ala 


Thr 


Gin 


He 


He 


Thr 


Phe 


Glu 


Ser 


Phe 


Lys 


Glu 


Asn 


Leu 


Lys 


Asp 


Phe 


Leu 


Leu 


Val 


He 


Pro 


Phe 


Asp 


Cys 


Trp 


Glu 


Pro 


Val 


Gin 


Glu. 









25 3. Un vecteur d'expression d'ADN recombinant selon la revendication 2 dans lequel ta sequence de 
codage d'ADN recombinant se compose de la sequence de nucleotides suivante : 



30 


GCA 


CCC 


GCC 


CGC 


TCG 


CCC 


AGC 


CCC 


AGC 


ACA 


CAG 


CCC 


TGG 


GAA 


CAT 


GTG 


AAT 


GCC 


ATC 


CAG 




GAA 


GCC 


CGG 


CGT 


CTC 


CTG 


AAC 


CTG 


AGT 


AGA 




GAC 


ACT 


GCT 


GCT 


GAG 


ATG 


AAT 


GAA 


ACA 


GTA 


35 


GAA 


GTC 


ATC 


TCA 


GAA 


ATG 


TTT 


GAC 


CTC 


CAG 




GAG 


CCG 


ACC 


TGC 


CTA 


CAG 


ACC 


CGC 


CTG 


GAG 




CTG 


TAC 


AAG 


CAG 


GGC 


CTG 


CGG 


GGC 


AGC 


CTC 


40 


ACC 


AAG 


CTC 


AAG 


GGC 


CCC 


TTG 


ACC 


ATG 


ATG 




GCC 


AGC 


CAC 


TAC 


AAA 


CAG 


CAC 


TGC 


CCT 


CCA 




ACC 


CCG 


GAA 


ACT 


TCC 


TGT 


GCA 


ACC 


CAG 


ATT 


45 


ATC 


ACC 


TTT 


GAA 


AGT 


TTC 


AAA 


GAG 


AAC 


CTG 




AAG 


GAC 


TTT 


CTG 


CTT 


GTC 


ATC 


CCC 


TTT 


GAC 




TGC 


TGG 


GAG 


CCA 


GTC 


CAG 


GAG 









4. Un vecteur d'expression d'ADN recombinant selon une des revendications 1 a 3 dans lequel TADN 
codant la prot§ine de GM-CSF humain est lie a la sequence de tete du facteur a de levure. 

5. Le vecteur d'expression d'ADN recombinant pYafGM-2 (ATCC 53157). 

6. Une cellule hote de levure transformee avec un vecteur d'expression d'ADN recombinant selon une 
des revendications 1 k 5. 
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7. Un proc^d§ de preparation d'une prot^ine de facteur stimulant de colonies humain mOr de macropha- 
ges de granulocytes, comprenant la croissance d'une levure hote selon la revendication 6 dans un 
milieu de culture et la recuperation de la proteins de GM-CSF humain a partir du milieu de culture. 

5 Revendlcatlons pour I'Etat contractant sulvant: AT 

1, Un proc^d^ de preparation d*un vecteur d'expression d'ADN recombinant comprenant un promoteur 
qui dirige I'expression dans une levure hote d'une sequence de codage d'ADN recombinant qui: 



(a) code une proteine de facteur stimulant de colonies de macrophages de granulocytes humain mOr 
(GM-CSF) poss^dant des r^sidus N-terminaux alanine-proline, laquelle proteine est capable de 
stimuler la croissance de colonies de moelle osseuse humaine dans I'essai de colonie de moelle 
osseuse humaine; 

15 (b) s'hybrid© avec un echantillon a un seul brin d'ADN radioactive compose d'un fragment Pstl-Hae 

III du gene GM-CSF murin correspondant aux nucleotides 45 a 400 indiques dans la figure 1, aprfes 
hybridation pendant une nuit entiere dans du 6xSSC a 55* C suivie d'un lavage au 6xSSC; et 
(c) est fusionn^e en son terminal 5' avec une sequence de tete derivee d'un gene de pheromone de 
maturation de la levure qui dirige la secretion de ladite proteine de GM-CSF humain mur dans le 

20 milieu de culture par clivage de cette tete a partir desdits residus N-terminaux alanine-proline; 

le precede comprenant le couplage de nucleotides successifs. 

2. Un proc^de selon la revendication 1 dans lequel la sequence de codage d'ADN recombinant code une 
25 proteine de GM-CSF humain mur poss^dant la sequence d'acides amines suivante: 



Ala 


Pro 


Ala 


Arg 


Ser 


Pro 


Ser 


Pro 


Ser 


Thr 


Gin 


Pro 


Trp 


Glu 


His 


Val 


Asn 


Ala 


He 


Gin 


Glu 


Ala 


-Arg 


Arg 


Leu 


Leu 


Asn 


Leu 


Ser 


Arg 


Asp 


Thr 


Ala 


Ala 


Glu 


Met 


Asn 


Glu 


Thr 


Val 


Glu 


Val 


He 


Ser 


Glu 


Met 


Phe 


Asp 


Leu 


Gin 


Glu 


Pro 


Thr 


Cys 


Leu 


Gin 


Thr 


Arg 


Leu 


Glu 


Leu 


Tyr 


Lys 


Gin 


Gly 


Leu 


Arg 


Gly 


Ser 


Leu 


Thr 


Lys 


Leu 


Lys 


Gly 


Pro 


Leu 


Thr 


Met 


Met 


Ala 


Ser 


His 


Tyr 


Lys 


Gin 


His 


Cys 


Pro 


Pro 


Thr 


Pro 


Glu 


Thr 


Ser 


Cys 


Ala 


Thr 


Gin 


He 


He 


Thr 


Phe 


Glu 


Ser 


Phe 


Lys 


Glu 


Asn 


Leu 


Lys 


Asp 


Phe 


Leu 


Leu 


Val 


He 


Pro 


Phe 


Asp 


Cys 


Trp 


Glu 


Pro 


Val 


Gin 


Glu. 









50 3. Un precede selon la revendication 2 dans lequel la sequence de codage d'ADN recombinant se 
compose de la sequence de nucleotides suivante: 



55 
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GCA 


CCC 


GCC 


CGC 




CAG 


CCC 


TGG 


GAA 


5 


GAA 


GCC 


CGG 


CGT 




GAC 


ACT 


GCT 


CjL.1 




GAA 


GTC 


ATC 


lUA 


JO 


GAG 


CCG 


ACC 


moo 

TGC 




CTG 


TAC 


AAG 


CAG 




ACC 


AAG 


CTC 


AAG 


15 


GCC 


AGC 


CAC 


TAC 


ACC 


CCG 


GAA 


ACT 




ATC 


ACC 


TTT 


GAA 




AAG 


GAC 


TTT 


CTG 


20 


TGC 


TGG 


GAG 


CCA 





err 


AGC 


CCC 


AGC 


ACA 




VJ X w 


AAT 


GCC 


ATC 


CAG 


V« X ^ 




AAC 


CTG 


AGT 


AGA 




ATG 


AAT 


GAA 


ACA 


GTA 






TTT 


GAC 


CTC 


CAG 


CTA 


CAG 


ACC 


CGC 


CTG 


GAG 


GGC 


CTG 




cc,r 




W X ^ 


GGC 


CCC 


TTG 


ACC 


ATG 


ATG 


AAA 


CAG 


CAC 


TGC 


CCT 


CCA 


TCC 


TGT 


GCA 


ACC 


CAG 


ATT 


AGT 


TTC 


AAA 


GAG 


AAC 


CTG 


CTT 


GTC 


ATC 


CCC 


TTT 


GAC 


GTC 


CAG 


GAG 









4. Un proc^d^ seton une des revendications 1 a 3 dans lequel I'ADN codant la proteine de GM-CSF 
25 humain est Ii4 ^ la sequence de tete du facteur a de levure. 

5. Un precede de preparation du vecteur d'expressron de I'ADN recombinant pYa fGM-2 (ATCC 53157), 
le precede comprenant le couplage de nucleotides successifs. 

30 6. Un proc^d§ de preparation d'une cellule hote de levure transfornnee, le proced§ comprenant la 
transformation d'une cellule hote de levure avec un vecteur d'expression d'ADN recombinant selon une 
des revendications 1 ^ 5. 

7. Un precede de preparation d'une proteine de facteur stimulant de colonies humain mOr de macropha- 
35 ges de granulocytes (GM-CSF), comprenant la croissance d'une levure hote prepar6e par un proc^d^ 
selon la revendication 6 dans un milieu de culture et la recuperation de la proteine de GM-CSF humain 
^ partir du milieu de culture. 

Patentanspriiche 

40 PatentansprUche fUr folgende Vertragsstaaten: BE, FR, DE, IT, LU, NL, SE, OH, LI, GB 

1. Expressionsvektor fur die Expression von rekombinanter DNA, welcher einen Promoter enthSIt, der 
seinerseits in einer Hefewirtszelle die Expression einer Kodiersequenz von rekombinanter DNA steuert, 
welche Kodiersequenz: 

45 (a) fUr ein Protein des reifen, Human-Granulozyten-Makrophagen-Kolonien stimulierenden Faktors 

(GM-CSF) kodiert. welches Protein N-terminale Alanin-Prolin-Reste aufweist, und welches Protein 
befahigt ist, das Wachstem von Human-Knochenmarks-Kolonien in dem Human-Knochenmarks- 
Kolonien-Assay zu stimulieren; 

(b) sich an eine radioaktiv markierte, einzelstrangige DNA-Sonde hybridisiert, welche aus einem Pstl- 
50 Hae- 1 1 1- Fragment des Mause-GM-CSF-Gens besteht, welches Fragment den in der Fig. 1 dargestell- 

ten Nikleotiden 45 bis 400 entspricht, und zwar nach einer Uber-Nacht-Hybridisierung in 6 x SSC bei 
55* C und einem anschlieflenden Waschen mit 6 x SSC; und 

(c) an ihrem 5'-Terminus an eine Leitsequenz fusioniert ist, welche aus einem entsprechenden Hefe- 
Pheromon-Gen stammt, und welche die Abssonderung des reifen Human-GM-CSF-Proteins in das 

55 Kulturmedium nach Abspalten der Leitsequenz von den N-terminalen Alanin-Prolin-Resten steuert. 

2. Expressionsvektor fUr die Expression von rekombinanter DNA nach Anspruch 1, worin die Kodierse- 
quenz von rekombinanter DNA fOr ein Protein des reifen Human-GM-CSF kodiert, welches Protein die 



21 



EP 0 183 350 B1 



AminosSurensequenz: 





Ala 


Pro 


Ala 


Arg 


Ser 


Pro 


Ser 


Pro 


Ser 


Thr 


5 


Gin 


Pro 


Trp 


Glu 


His 


Val 


Asn 


Ala 


He 


Gin 




Glu 


Ala 


Arg 


Arg 


Leu 


Leu 


Asn 


Leu 


Ser 


Arg 




Asp 


Thr 


Ala 


Ala 


Glu 


Met 


Asn 


Glu 


Thr 


Val 


TO 


Glu 


Val 


He 


Ser 


Glu 


Met 


Phe 


Asp 


Leu 


Gin 




Glu 


Pro 


Thr 


Cys 


Leu 


Gin 


Thr 


Arg 


Leu 


Glu 




Leu 


Tyr 


Lys 


Gin 


Gly 


Leu 


Arg 


Gly 


Ser 


Leu 




Thr 


Lys 


Leu 


Lys 


Gly 


Pro 


Leu 


Thr 


Met 


Met 


75 


Ala 


Ser 


His 


Tyr 


Lys 


Gin 


His 


Cys 


Pro 


Pro 




Thr 


Pro 


Glu 


Thr 


Ser 


Cys 


Ala 


Thr 


Gin 


He 




lie 


Thr 


Phe 


Glu 


Ser 


Phe 


Lys 


Glu 


Asn 


Leu 


20 


Lys 


Asp 


Phe 


Leu 


Leu 


Val 


He 


Pro 


Phe 


Asp 




Cys 


Trp 


Glu 


Pro 


Val 


Gin 


Glu. 









atlfweist. 

25 

3, Expressionsvektor fiir die Expression von rekombinanter DNA nach Anspruch 2, worin die Kodierse- 
quenz von rekombinanter DNA aus der folgenden Nukleotidsequenz besteht: 



30 


GCA 


CCC 


GCC 


CGC 


TCG 


CCC 


AGC 


CCC 


AGC 


ACA 


CAG 


CCC 


TGG 


GAA 


CAT 


GTG 


AAT 


GCC 


ATC 


CAG 




GAA 


GCC 


CGG 


CGT 


CTC 


CTG 


AAC 


CTG 


AGT 


AGA 




GAC 


ACT 


GCT 


GCT 


GAG 


ATG 


AAT 


GAA 


ACA 


GTA 


35 


GAA 


GTC 


ATC 


TCA 


GAA 


ATG 


TTT 


GAC 


CTC 


CAG 




GAG 


CCG 


ACC 


TGC 


CTA 


CAG 


ACC 


CGC 


CTG 


GAG 




CTG 


TAC 


AAG 


CAG 


GGC 


CTG 


CGG 


GGC 


AGC 


CTC 


40 


ACC 


AAG 


CTC 


AAG 


GGC 


CCC 


TTG 


ACC 


ATG 


ATG 




GCC 


AGC 


CAC 


TAC 


AAA 


CAG 


CAC 


TGC 


CCT 


CCA 




ACC 


CCG 


GAA 


ACT 


TCC 


TGT 


GCA 


ACC 


CAG 


ATT 


45 


ATC 


ACC 


TTT 


GAA 


AGT 


TTC 


AAA 


GAG 


AAC 


CTG 




AAG 


GAC 


TTT 


CTG 


CTT 


GTC 


ATC 


CCC 


TTT 


GAC 




TGC 


TGG 


GAG 


CCA 


GTC 


CAG 


GAG. 
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4. Expressionsvektor fOr die Expression von rekombinanter DNA nach einem der Anspruche 1 bis 3. worin 
die fUr das Protein des Human-GM-CSF kodierende DNA an die Hefe-alpha-Faktor-Leitsequenz 
angeheftet ist. 

55 5. Der Expressionsvektor pY a fGM-2 (ATCC 53157) fur die Expression von rekombinanter DNA. 

6. Eine Hefewirtszelle, welche mit einem Expressionsvektor fur die Expression von rekombinanter DNA 
nach einem der AnsprUche 1 bis 5 transformiert worden ist. 
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7. Verfahren zur Herstellung eines Proteins des reifen, Human-Granulozyten-Makrophagen-Kolonien stimu- 
lierenden Faktors (GM-CSF), welches das Zuchten von Hefewirtszellen nach Anspruch 6 in einem 
Kulturmediunn und das Gewinnen des Human-GM-CSF-Proteins aus dem Kulturnnediunn umfafit. 

PatentansprUche fUr folgenden Vertragsstaat: AT 

1, Verfahren zur Herstellung eines Expressionsvektors fOr die Expression von rekombinanter DNA, 
welcher einen Promotor enthalt, der seinerseits in einer Hefewirtszelle die Expression einer Kodierse- 
quenz von rekombinanter DNA steuert, welche Kodiersequenz: 

(a) fur ein Protein des reifen, Human-Granulozyten-Makrophagen-Kolonien stinnulierenden Faktors 
(GM-CSF) kodiert, welches Protein N-terminale Alanin-Prolin-Reste aufweist, und welches Protein 
bofahigt ist. das Wachstum von Human-Knochenmarks-Kolonien in dem Human-Knochenmarks- 
Kolonien-Assay zu stimuiieren; 

(b) sich an eine radioaktiv markierte, einzelstrangige DNA-Sonde hybridisiert, welche aus einem Pstl- 
Hae- 1 1 1- Fragment des Mause-GM-CSF-Gens besteht. welches Fragment den in der Fig. 1 dargestell- 
ten Nikleotiden 45 bis 400 entspricht, und zwar nach einer Uber-Nacht-Hybridisierung in 6 x SSC bei 
55* C und einem anschlieflenden Waschen mit 6 x SSC; und 

(c) an ihrem 5'-Terminus an eine Leitsequenz fusioniert ist, welche aus einem entsprechenden Hefe- 
Pheromon-Gen stammt, und welche die Absonderung des reifen Human-GIVI-CSF- Proteins in das 
Kulturmedium nach Abspalten der Leitsequenz von den N-terminaten Alanin-Prolin-Resten steuert;. 

und welches Verfahren das Aneinanderkuppein von aufeinanderfolgenden Nukleotiden umfaflt. 

2. Verfahren nach Anspruch 1 . wobei die Kodiersequenz von rekombinanter DNA fUr ein Protein des reifen 
Human-GM-CSF kodiert, welches Protein die Aminosaurensequenz: 



Ala 


Pro 


Ala 


Arg 


Ser 


Pro 


Ser 


Pro 


Ser 


Thr 


Gin 


Pro 


Trp 


Glu 


His 


Val 


Asn 


Ala 


He 


Gin 


Glu 


Ala 


Arg 


Arg 


Leu 


Leu 


Asn 


Leu 


Ser 


Arg 


Asp 


Thr 


Ala 


Ala 


Glu 


Met 


Asn 


Glu 


Thr 


Val 


Glu 


Val 


He 


Ser 


Glu 


Met 


Phe 


Asp 


Leu 


Gin 


Glu 


Pro 


Thr 


Cys 


Leu 


Gin 


Thr 


Arg 


Leu 


Glu 


Leu 


Tyr 


Lys 


Gin 


Gly 


Leu 


Arg 


Gly 


Ser 


Leu 


Thr 


Lys 


Leu 


Lys 


Gly 


Pro 


Leu 


Thr 


Met 


Met 


Ala 


Ser 


His 


Tyr 


Lys 


Gin 


His 


Cys 


Pro 


Pro 


Thr 


Pro 


Glu 


Thr 


Ser 


Cys 


Ala 


Thr 


Gin 


He 


He 


Thr 


Phe 


Glu 


Ser 


Phe 


Lys 


Glu 


Asn 


Leu 


Lys 


Asp 


Phe 


Leu 


Leu 


Val 


He 


Pro 


Phe 


Asp 


Cys 


Trp 


Glu 


Pro 


Val 


Gin 


Glu, 









aufweist. 



3. Verfahren nach Anspruch 2, wobei die Kodiersequenz von rekombinanter DNA aus der folgenden 
Nukleotidsequenz besteht: 
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5 



GCA 


CCC 


GCC 


CGC 


TCG 


CCC 


AGC 


CCC 


AGC 


ACA 


CAG 


CCC 


TGG 


GAA 


CAT 


GTG 


AAT 


GCC 


ATC 


CAG 




GCC 

W V* V* 


CGG 


CGT 


CTC 


CTG 


AAC 


CTG 


AGT 


AGA 


CAP 


ACT 


GPT 


GPT 




ATC 


AAT 


GAA 


ACA 


GTA 






ATC 


TPA 






X X ^ 




PTP 


PAfz 










PT^ 
wl A 




APP 




V. X VJ 


uAVs 


CTG 


TAC 


AAG 


CAG 


GGC 


CTG 


CGG 


GGC 


AGC 


CTC 


ACC 


AAG 


CTC 


AAG 


GGC 


CCC 


TTG 


ACC 


ATG 


ATG 


GCC 


AGC 


CAC 


TAC 


AAA 


CAG 


CAC 


TGC 


CCT 


CCA 


ACC 


CCG 


GAA 


ACT 


TCC 


TGT 


GCA 


ACC 


CAG 


ATT 


ATC 


ACC 


TTT 


GAA 


AGT 


TTC 


AAA 


GAG 


AAC 


CTG 


AAG 


GAC 


TTT 


CTG 


err 


GTC 


ATC 


CCC 


TTT 


GAC 


TGC 


TGG 


GAG 


CCA 


GTC 


CAG 


GAG. 









4. Verfahren nach einem der Anspruche 1 bis 3, wobei die fur das Protein des Human-GM-CSF 
kodierende DNA an die Hefe-alpha-Faktor-Leitsequenz angeheftet ist. 

25 

5. Verfahren zur Herstellung des Expressionsvektors pY a fGM-2 (ATCC 53157) fur die Expression von 
rekombinanter DNA, welches Verfahren das Aneinanderkuppeln von aufeinanderfolgenden Nukleotiden 
umfa^t. 

30 6. Verfahren zur Herstellung einer transform lerten Hefewirtszelle, welches Verfahren das Transformieren 
einer Hefewirtszelle mit einem Expressionsvektor fur, die Expression von rekombinanter DNA nach 
einem der Anspruche 1 bis 5 umfa/Jt. 

7. Verfahren zur Herstellung eines Proteins des reifen, Human-Granulozyten-Makrophagen-Kolonien stimu- 
35 lierenden Faktors (GM-CSF), welches das ZUchten von Hefewirtszellen, welche nach eins Verfahren 
gemaB dem Anspruch 6 hergestellt worden sind, in einem Kulturmedium und das Gewinnen des 
Human-GM-CSF-Proteins aus dem Kulturmedium umfaflt. 
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'° ^? c c C . ^? 



TCTCCCAGACTCCTGCGTCCGCGACTCCCATAATTCTTACCCCGCCTTGCAAnCATGTACACrCC 
ScrGltiSerArgGlyTrpAlaSerGlyl 1 cThrValThrAr gProTrpLysH i sVa IGI nAla 

K 

70 ^ 90 ,110 130 

• G • A • 

ATCAAACAAGCCCTAAACCTCCTGGATGACATGCCTGTCACGTTCAATGAAGAGCTAGAACTCGT 
I 1 cty sGl iiA 1 aLrtiA 5nl.cul,eviAs pA r. pMc tP r oVa 1 Til rLeuA s nGl uGl uVa 1 G 1 nVn 1 Va 

150 170 190 

• • • 

CTCTAACGAGTTCTCCTTCAAGAAGCTAACATGTGTGCAGACCCCCCTGAAGATATTCCAGCAGG 
ISerAsnGluPhcSerPhcL'ysLysLcuThrCysValGlnThrArgLeuLysllePhcGluGlnC 

210 230 250 

GTCTACCACCGGGCAATTTCACCAAACTCAAGGGCCCCTTGAACATGACAGCCAGCTACTACCAC 
lyLeuProArgGlyAsnPheThrtysLeuLysGlyAlaLeuAsnMetThrAlaSerTyrTyrGln 
CM-C5F cDNA Screening Probe Sequence ' 

270 290 310 

• • « 

ACATACTGCCCCCCAACTCCGGAAACGGACTCTGAAACACAAGTCACCACCTATGCGGATTTCAT 

ThrTyrCysP r oP roTh rProG 1 uThrAs pCy sG luThrGl nVa IThrThrTyr Al aAs pPhe 1 1 

330 350 370 390 

• ♦ • • 

AGACACCCTTAAAACCTTTCTGACTGATATCCCCTTTGAATGCAAAAAACCAAGCCAAAAATCAG 
eAspSerLcuLysThrPheLeuThrAspIleProPhcGluCysLysLysProSerGlnLysEnd 

4ilO 430 450 

« • • 

CAAGCCCAGGCCAGCTCTGAATCCAGCTTCTCAGACTGCTGCTTTTGTGCCTGCGTAATGAGCCA 

^ 

470 490 510 - 

c C . . . 

AGAACTTGGAATTTCTGCCTTAAAGGGACCAAGAGATGTGCCACAGCCACAGTTCGAAGGCAGTA 

530 ^ 550 570 

TACCCCTCTGAAAACCCTAACTCAGCTTGGACAGCGGAACACAAACGAGAGATATTTTCtACTGA 

590 610 630 650 

• • • • 

TAGGGACCATTATATTTATTTATATATTTATATTTTTTAAATATTATTTATTTATTTATTTATTT 

670 690 710 

• • • 

TTCCAACTCTATTTATTGAGAATGTCTTACCAGAATAATAAATTATTAAAACTTTAAAAAAAAAA 



AAAAAAAAAA 



Nucleotide sequence (upper line) and amino acid . sequence (lower line) of 
murine granulocyte-macrophage colony stimulating factor gene. Cough et al., 309 Nature 
(London) 763 (198**). Piasmid DNA fragment isolated with synthetic ollgcmucleotide probe 
extends from nucleotide Nos. 1 through 600. Overlined portions (nucleotide Nos. 1-28 and 
201.203) contained in isolated piasmid DNA but not in Cough et al. Nucleotide No. 505 
not contained in isolated piasmid DNA. Other differences in the nucleotide sequence of 
the isolated piasmid DNA are shown above corresponding nucleotides of the gene in Cough 
ft al. Thf portion of the DNA fragment used to probe human cDNA library is indicated by 
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10 I r^^* 30 50 70 



CTCCAGCATCTCTGCACCCGCCCCCTCGCCCACCCCCACCACACAGCCCTGGCACCATGTCAATGCCATCCAGGAG 
CysSerneSerAlaProAlaArgSerProScrProScrThrClnProTrpClullisValAsnAUIleGlnClu 

10 20 

90 110 130 150 

• ••••••• 

GCCCGCCCTCTCCTGAACCTGAGTAGAGACACTGCTGCTGAGATGAATGAAACACTAGAAGTCATCTCAGAAATG 
AlaArgArgLeuLeuAsnLeuSerArgAspThrAlaAlaGluMe tAsnGluThrValGluVallleSerGluKec 

30 AO 

170 190 210 

• • • • • • ■ 

TTTGACCTCCAGGAGCCGACCTGCCTACAGACCCGCCTGGAGCTGTACAACCAGGGCCTGCGGGGCACCCTCACC 
PheAspLeuGlnGluProThrCysLeuGlnThrArgLeuGluLeuTyrtyBGlnGlyLeuArgGlySerLeuThr 
50 60 70 

230 250 270 290 

• ••>•• 

AAGCTCAAGGGCCCCTTGACCATGATGGCCAGCCACTACAAACAGCACTGCCCTCCAACCCCGGAAACTTCCTGT 
LysLcuLysGlyProLeuThrMetMe tAlaSerHisTyrLysGlnHisCysProProThrProGluThrSerCys 

80 90 

310 330 350 370 

GCAACCCACATTATCACCTTTGAAAGTTTCAAAGAGAACCTGAAGGACTTTCTGCTTGTCATCCCCTTTGACTGCTGG 
AlaThrGlnllelleThrPheGluSerPhchysGluAsnLeuLyaAspPheLeuLeuVallleProPheAspCysTrp 
100 110 120 

390 410 A30 A50 

CAGCCAGTCCAGGAGTGAGACCGGCCAGATGAGGCTGGCCAAGCCGGCGAGCTGCTCTCTCATGAAACAAGAGCTAG 
GluProValGlnGluEnd 

470 490 510 530 

• ■•*>••• 

AAACTCAGGATGGTCATCTTGGAGGGACCAAGGGGTGGGCCACACCCATGGTGGGAGTGGCCTGCACTGCCTGGCCA 

Nco I 

550 570 590 

CACTCACCTGATACAGCCATGGCAGAAGAATGGCATATTTATACTGACAAATACTGATATrATATATTATATTT 

610 630 650 

• ■*••■ 

TAAATAATTTAATTTAATTTAATTTAATTTAATTGACTAATTACTATTATTACG 
PIP p 

' • w.C Nucleotide sequence (upper line) and amino acid sequence 
(lower line) for human plaaraid DNA pHG23 coding for granulocyte- 
macrophage colony stimulating factor gene. Mature protein begina at 
asterisk (*). 
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A GCJ^JCJ TIG CAT AAA AOA GC" 
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ser leu asp /ys org • o/o pro 



0 FACm 
PROCESSING 



FIG.3 
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LANES 

12 3 4 5 6 



28S 





FIG. 4 

Hybridization of GM-CSF probe to Northern blots of UNA from human 
cells. Lane 1, 5 yg of total RNA from HUT-102 cells. Lane 2, 5 yg of total RNA 
from unstimdated peripheral blood T cells. Lane 3, 5 yg of total RNA from 
peripheral blood T cells stimulated with concanavalin A and phorbol myristic 
acetate. Lane 4, 1.5 yg of polyadenylated RNA from peripheral blood macrophages 
stimulated with lipopolysaccharide. Lane 5, 1,5 yg of polyadenylated RNA from the 
pancreatic carcinoma ceD line 1420. Lane 6, 1.5 yg of polyadenylated RNA from 
the bladder carcinoma cell line 5637. The positions of 18S and 28S rRNA bands are 
indicated. 
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LANKS 

12 3 4 




pairs 



FIG.5 Hybridization of GM-CSF cDNA to Southern blots of human genomic 
DNA. Genomic DNA digested with: Hind HI (lane 1), Eco Rl (lane 2), Pst I (lane 3), 
Bgl n aane 4). 
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